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Abstract. We consider constraint satisfaction problems parameterized above or 
below tight bounds. One example is MaxSat parameterized above m/2: given 
a CNF formula F with m clauses, decide whether there is a truth assignment 
that satisfies at least m/2 + k clauses, where k is the parameter. Among other 
problems we deal with are MaxLin2- AA (given a system of linear equations over 
F2 in which each equation has a positive integral weight, decide whether there 
is an assignment to the variables that satisfies equations of total weight at least 
W/2 + k, where W is the total weight of all equations), Max-r-Lin2-AA (the 
same as MaxLin2-AA, but each equation has at most r variables, where r is a 
constant) and Max-r-Sat-AA (given a CNF formula F with m clauses in which 
each clause has at most r literals, decide whether there is a truth assignment 
satisfying at least YliLi (1 — 2 r ') + k clauses, where k is the parameter, n is the 
number of literals in Clause i, and r is a constant). We also consider Max-r-CSP- 
AA, a natural generalization of both Max-r-Lin2-AA and Max-r-Sat-AA, order 
(or, permutation) constraint satisfaction problems of arities 2 and 3 parameterized 
above the average value and some other problems related to MaxSat. We discuss 
results, both polynomial kernels and parameterized algorithms, obtained for the 
problems mainly in the last few years as well as some open questions. 



1 Introduction 

This paper surveys mainly recent results in a subarea of parameterized algo- 
rithms and complexity that was launched quite early in the short history of pa- 
rameterized algorithms and complexity, namely, in the Year 2 BDlfl 

Consider the well-known problem MaxSat, where for a given CNF for- 
mula F with m clauses, we are asked to determine the maximum number of 
clauses of F that can be satisfied by a truth assignment. It is well-known (and 
shown below, in Section H]) that there exists a truth assignment to the variables 
of F which satisfies at least m/2 clauses. 



1 BDF stands for Before Downey-Fellows, i.e., before 1999 when the first monograph describ- 
ing foundations of parameterized algorithms and complexity was published 1 18|. 



The standard parametrization /c-MaxSat of MaxSat is as follows: de- 
cide whether there is a truth assignment which satisfies at least k clauses of F, 
where k is the parameter. (We provide basic terminology and notation on pa- 
rameterized algorithms and complexity in Section [2) It is very easy to see that 
/c-MaxS at has a kernel with a linear number of variables. Indeed, consider an 
instance / of A;-MaxSat. If k < m/2 then I is a YES-instance. Otherwise, we 
have k > m/2 and m < 2k — 1. Suppose that we managed somehow to obtain 
a better result, a kernel with at most pk variables, where 1 < p < 2. Is such a 
kernel of any interest? Such a kernel would be of interest only for k > m/2, i.e., 
when the size of the kernel would be bounded by pk > pm/2. Thus, such a ker- 
nel should be viewed as huge rather than small as the bound pk might suggest 
at the first glance. 

The bound m/2 is tight as we can satisfy only half clauses in the instances 
consisting of pairs (x), (x) of clauses. This suggest the following parameteri- 
zation of MaxSat above tight bound introduced by Mahajan and Raman l44l : 
decide whether there is a truth assignment which satisfies at least m/2 + k 
clauses of F, where k is the parameter. 

To the best our knowledge, PHI was the first paper on problems parameter- 
ized above or below tight bounds and remained the only one for several years, 
at least for constraint satisfaction problems (CSPs). However, in the last few 
years the study of CSPs parameterized above or below tight bounds has finally 
picked up. This is, in large part, due to emergence of new probabilistic and 
linear-algebraic methods and approaches in the area. 

In this survey paper, we will overview several results on CSPs parameterized 
above or below tight bounds, as well as some methods used to obtain these 
results. While not going into details of the proofs, we will discuss some ideas 
behind the proofs. We will also consider several open problems in the area. 

In the remainder of this section we give a brief overview of the paper and its 
organization. 

In the next section we provide basics on parameterized algorithms and com- 
plexity. The notions mentioned there are all well-known apart from a recent 
notion of a bikernel introduced by Alon et al. [2j. In Section [3l we describe 
some probabilistic and Harmonic Analysis tools. These tools are, in particular, 
used in the recently introduced Strictly-Above-Below-Expectation method [30]. 

Results on MaxSat parameterized above or below tight bounds are dis- 
cussed in Section 01 We will consider the above-mentioned parameterization of 
MaxSat above tight bound, some "stronger" parameterizations of MaxSat 
introduced or inspired by Mahajan and Raman PHI . The stronger parameteriza- 
tions are based on the notion of a t-satisfiable CNF formula (a formula in which 
each set of t clauses can be satisfied by a truth assignment) and asymptotically 
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tight lower bounds on the maximum number of clauses of a t-satisfiable CNF 
formula satisfied by a truth assignment for t = 2 and 3. We will describe linear- 
variable kernels obtained for both t = 2 and 3. 

We will also consider the parameterization of 2-S AT below the upper bound 
m, the number of clauses. This problem was proved to be fixed-parameter tractable 
by Razgon and O'Sullivan [52]. Raman et al. [51] and Cygan et al. [17] designed 
faster parameterized algorithms for the problem. The problem has several appli- 
cation, which we will briefly overview. 

Boolean Maximum r-CSPs parameterized above the average value are con- 
sidered in Section[5] where r is a positive integral constant. In general, the Max- 
imum r-CSP is given by a set V of n variables and a set of m Boolean formulas; 
each formula is assigned an integral positive weight and contains at most r vari- 
ables from V. The aim is to find a truth assignment which maximizes the weight 
of satisfied formulas. Averaging over all truth assignments, we can find the av- 
erage value A of the weight of satisfied formulas. It is easy to show that we can 
always find a truth assignment to the variables of V which satisfied formulas of 
total weight at least A. Thus, a natural parameterized problem is whether there 
exists a truth assignment that satisfies formulas of total weight at least A + k, 
where k is the parameter (k is a nonnegative integer). We denote such a problem 
by MAX-r-CSP-AA. 

The problem MAX-r-LlN2-AA is a special case of MAX-r-CSP-AA when 
every formula is a linear equation over F2 with at most r variables. For MAX-r- 
LIN2-AA, we have A = W/2, where W is the total weight of all equations. It is 
well-known that, in polynomial time, we can find an assignment to the variables 
that satisfies equations of total weight at least W/2, but, for any e > it is 
NP-hard to decide whether there is an assignment satisfying equations of total 
weight at least W(l + e)/2 ll33l . We give proof schemes of a result by Gutin 
et al. [30 ] that MAX-r-LlN2-AA has a kernel of quadratic size and a result of 
Crowston, Fellows et al. lfl2l that MAX-r-LlN2-AA has a kernel with at most 
(2k — l)r variables. The latest result improves that of Kim and Williams |39l 
that MAX-r-LlN2-AA has a kernel with at most r(r + l)k variables. Papers 
II12I391I imply an algorithm of runtime 2°^ + for MAX-r-LlN2-AA. 

We give a proof scheme of a result by Alon et al. [2] that MAX-r-CSP- 
AA has a a kernel of polynomial size. The main idea of the proof is to reduce 
MAX-r-CSP-AA to MAX-r-LlN2-AA and use results on MAX-r-LlN2-AA 
and a lemma on bikernels given in the next section. The result of Alon et al. O 
solves an open question of Mahajan, Raman and Sikdar fl3l not only for MAX- 
r-SAT-AA but for the more general problem MAX-r-CSP-AA. The problem 
MAX-r-S AT-AA is a special case of MAX-r-CSP-AA when every formula is a 
clause with at most r variables. For MAX-r-SAT-AA, the reduction to MAX-r- 
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LIN2-AA can be complemented by a reduction from MAX-r-LlN2-AA back to 
MAX-r-SAT-AA, which yields a kernel of quadratic size. (Note that while the 
size of the kernel for MAX-r-CSP-AA is polynomial we are unable to bound 
the degree of the polynomial.) 

MaxLin2-AA is the same problem as MAX-r-LlN2-AA, but the number 
of variables in an equation is not bounded. Thus, MaxLin2-AA is a general- 
ization of MAX-r-LlN2-AA. Section [6] presents a scheme of a recent proof by 
Crowston, Fellows et al. lfl2l that MaxLin2-AA is fixed-parameter tractable 
and has a kernel with polynomial number of variables. This result finally solved 
an open question of Mahajan, Raman and Sikdar B31 . Still, we do not know 
whether MaxLin2-AA has a kernel of polynomial size and we present only 
partial results on the topic. Max-Sat- AA is the same problem as MAX-r-SAT- 
AA, but the number of variables in a clause is not bounded. Crowston et al. lfI3i 
proved that MAX-S AT-AA is para-NP-complete and, thus, MaxSat-AA is not 
fixed-parameter tractable unless P=NP. We give a short discussion of this result 
in the end of Section [6] 

In Section |7] we discuss parameterizations above average of Ordering CSPs 
of arities 2 and 3. It turns out that for our parameterization the most important 
Ordering CSP is the problem r-LlNEAR ORDERING (r > 2). An instance of 
r-LiNEAR Ordering consists of a set V of variables and a multiset C of con- 
straints, which are ordered r-tuples of distinct variables of V (note that the same 
set of r variables may appear in several different constraints). The objective is to 
find an ordering a of V that maximizes the number of constraints whose order 
in a follows that of the constraint (such constraints are satisfied by a). 

It is easy to see that |C|/r! is the average number of constraints satisfied by 
an ordering of V and that it is a tight lower bound on the maximum number of 
constraints satisfied by an ordering of V. The only nontrivial Ordering CSP of 
arity 2 is 2-LlNEAR ORDERING. For this problem, Guruswami, Manokaran and 
Raghavendra [26 ] proved that it is impossible to find, in polynomial time, an or- 
dering that satisfies at least \C\ (l + e)/2 constraints for every e > provided the 
Unique Games Conjecture (UGC) of Khot l38l holds. Similar approximation 
resistant results were proved for all Ordering CSPs of arity 3 by Charikar, Gu- 
ruswami and Manokaran [8 ] and for Ordering CSPs of any arity by Guruswami 
etal. Il25l . 

In the problem r-LlNEAR ORDERING parameterized above average (r-LlNEAR 
Ordering- AA), given an instance of t-Linear Ordering with a multiset C 
of constraints, we are to decide whether there is an ordering satisfying at least 
|C|/r! + k constraints, where k is the parameter. Gutin et al. [30] proved that 
2-LlNEAR ORDERING-AA is fixed-parameter tractable and, moreover, has a 
kernel of a quadratic size. BETWEENNESS is an Ordering CSP of arity 3, which 
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is formulated in Section [7] Gutin et al. ||29l solved an open question of Benny 
Chor stated in Niedermeier's monograph [48] by showing that BETWEENNESS 
parameterized above average is fixed-parameter tractable and, moreover, has a 
kernel of a quadratic size. 

A simple, yet important, observation is that all Ordering CSPs of arity 3 
parameterized above average can be reduced, in polynomial time, to 3 -LINEAR 
Ordering parameterized above average (3 -Linear Ordering- A A) and that 
this reduction preserves the parameter. Thus, to prove that all Ordering CSPs of 
arity 3 parameterized above average are fixed-parameter tractable, it suffices to 
show that 3-LlNEAR ORDERING-AA is fixed-parameter tractable. Gutin et al. 
(271 proved that 3-LlNEAR ORDERING-AA is fixed-parameter tractable and, 
moreover, has a kernel with a quadratic number of constraints and variables. 

Kim and Williams ll39l partially improved the results above by showing 
that 2-Linear Ordering- A A and 3 -Linear Ordering- A A have kernels 
with linear number of variables. Parameterized complexity of Ordering CSPs 
of arities 4 and higher is still unknown. It seems to be technically much more 
difficult to prove that 4-LlNEAR ORDERING-AA is fixed-parameter tractable 
than that 3-LlNEAR ORDERING-AA is fixed-parameter tractable. 

2 Basics on Parameterized Algorithms and Complexity 

A parameterized problem 77 can be considered as a set of pairs (I, k) where 7 
is the problem instance and k (usually a nonnegative integer) is the parameter. 
II is called fixed-parameter tractable (fpt) if membership of (I, k) in 77 can 
be decided by an algorithm of runtime 0(f(k)\I\ c ), where |7| is the size of 
7, f(k) is an arbitrary function of the parameter k only, and c is a constant 
independent from k and 7. Such an algorithm is called an fpt algorithm. Let 77 
and 77' be parameterized problems with parameters k and k', respectively. An 
fpt-reduction Rfrom U to 77' is a many-to-one transformation from 77 to 77', 
such that (i) (7, k) G 77 if and only if (7', k') € 77' with k' < g(k) for a fixed 
computable function g, and (ii) 7? is of complexity 0(/(A:)|7| c ). 

If the nonparameterized version of 77 (where k is just part of the input) 
is NP-hard, then the function f(k) must be superpolynomial provided P^NP 
Often f(k) is "moderately exponential," which makes the problem practically 
feasible for small values of k. Thus, it is important to parameterize a problem in 
such a way that the instances with small values of k are of real interest. 

When the decision time is replaced by the much more powerful |/|°(/( fe )), 
we obtain the class XP, where each problem is polynomial-time solvable for 
any fixed value of k. There is an infinite number of parameterized complexity 
classes between FPT and XP (for each integer t > 1, there is a class W[i]) and 
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they form the following tower: 

FPT C W[l] C W[2] C • • • C W[P] C XP. 

Here W[P] is the class of all parameterized problems (x, Ar) that can be decided 
in f(k)\x\°^ time by a nondeterministic Turing machine that makes at most 
f(k) log |7| nondeterministic steps for some function /. For the definition of 
classes W[t], see, e.g., IT221 (we do not use these classes in the rest of the paper). 

77 is in para-NP if membership of (7, k) in PI can be decided in nonde- 
terministic time 0(/(A;)|7| c ), where |7| is the size of 7, f(k) is an arbitrary 
function of the parameter k only, and c is a constant independent from k and 
7. Here, nondeterministic time means that we can use nondeterministic Turing 
machine. A parameterized problem PI' is para-NF '-complete if it is in para-NP 
and for any parameterized problem PL in para-NP there is an fpt-reduction from 

n to n'. 

While several fpt algorithms were designed many years ago (e.g., pseudo- 
polynomial algorithms with parameter being the binary length of the maximum 
number, cf. [23]), Downey and Fellows were the first to systematically study 
the theory of parameterized algorithms and complexity and they wrote the first 
monograph lTT8l in the area^. 

Given a pair PL, TV of parameterized problems, a bikernelization from TI to 
TI' is a polynomial-time algorithm that maps an instance (7, k) to an instance 
(V,k') (the bikernel) such that (i) (7, k) G 77 if and only if (V, k') G 77', 
(ii) k! < f(k), and (hi) |7'| < g[k) for some functions / and g. The function 
g(k) is called the size of the bikernel. A kernelization of a parameterized prob- 
lem PI is simply a bikernelization from 77 to itself. Then (7', k 1 ) is a kernel. 
The term bikernel was coined by Alon et al. 0; in (H a bikernel is called a 
generalized kernel. 

It is well-known that a parameterized problem 77 is fixed-parameter tractable 
if and only if it is decidable and admits a kernelization 11181221481 . This result 
can be extended as follows: A decidable parameterized problem 77 is fixed- 
parameter tractable if and only if it admits a bikernelization from itself to a 
decidable parameterized problem 77' 0. 

Due to applications, low degree polynomial size kernels are of main inter- 
est. Unfortunately, many fixed-parameter tractable problems do not have kernels 
of polynomial size unless the polynomial hierarchy collapses to the third level 
M5I6I20I1 . For further background and terminology on parameterized complexity 
we refer the reader to the monographs 111 812214811 . 

2 Michael R. Fellows has worked tirelessly for many years to promote the area and so can be 
affectionately called St. Paul of Parameterized Complexity. 
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The following lemma of Alon et al. [2] inspired by a lemma from [ 6 ] shows 
that polynomial bikernels imply polynomial kernels. 

Lemma 1. Let IT, II' be a pair of decidable parameterized problems such that 
the nonparameterized version of II' is in NP, and the nonparameterized version 
of II is NP-complete. If there is a bikernelization from LJ to II' producing a 
bikernel of polynomial size, then LJ has a polynomial-size kernel. 

Henceforth [n] stands for the set {1, 2, . . . , n}. 
3 Probabilistic and Harmonic Analysis Tools 

We start this section by outlining the very basic principles of the probabilistic 
method which will be implicitly used in this paper. Given random variables 
Xi, . . . , X n , the fundamental property known as linearity of expectation states 
that E(Xl + . . . + X n ) = E(Xl) + . . . + K(X n ). The averaging argument 
utilizes the fact that there is a point for which X > E(X) and a point for which 
X < K(X) in the probability space. Also a positive probability F(A) > for 
some event A means that there is at least one point in the probability space 
which belongs to A. For example, F(X > k) > tells us that there exists a 
point for which X > k. 

A random variable is discrete if its distribution function has a finite or count- 
able number of positive increases. A random variable X is symmetric if —X has 
the same distribution function as X. If X is discrete, then X is symmetric if and 
only if F(X = a) = F(X = —a) for each real a. Let X be a symmetric variable 
for which the first moment E(X) exists. Then E(X) = E(-X) = -E(X) and, 
thus, E(X) = 0. The following is easy to prove [30]. 

Lemma 2. If X is a symmetric random variable and E(X 2 ) is finite, then 

F(X> tJeix 2 ) ) > 0. 

If X is not symmetric then the following lemma can be used instead (a 
similar result was already proved in |[3l). 

Lemma 3 (Alon et al. [2 |). Let X be a real random variable and suppose that 
its first, second and fourth moments satisfy E[X] = 0, E[X 2 ] = a 2 > and 
Epf 4 ] < cE[X 2 ] 2 , respectively, for some constant c. Then F(X > ^) > 0. 

To check E[X 4 ] < cE[X 2 ] 2 we often can use the following well-known in- 
equality. 
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Lemma 4 (Hypercontractive Inequality |7|). Let f = f(x\, . . . ,x n ) be a 
polynomial of degree r in n variables x\, . . . ,x n each with domain { — 1,1}. 
Define a random variable X by choosing a vector (ej, . . . , e n ) G { — 1, l} n 
uniformly at random and setting X = f(e%, . . . ,e n ). Then E[X 4 } < 9 r E[X 2 ] 2 . 

If / = f{x\, ■ ■ ■ ,x n ) is a polynomial in n variables xi, . . . , x n each with 
domain {—1, 1}, then it can be written as / = Ylic[n] °i ILes x *' wnere l n ] = 
{1, . . . , n} and cj is a real for each I C [n]. 

The following dual, in a sense, form of the Hypercontractive Inequality was 
proved by Gutin and Yeo lOTTl : for a weaker result, see ll30lL 

Lemma 5. Let f = f(x±, . . . , x n ) be a polynomial in 7i variables x^, . . . , x n 
each with domain { — 1, 1} such that f = 5Zjc[n] Cl TlieS x i- Suppose that no 
variable Xi appears in more than p monomials of f. Define a random variable X 
by choosing a vector (ei, . . . , e n ) £ { — 1, l} n uniformly at random and setting 
X = /(ei, . . . , en). Then E[X 4 ] < (2p + 1)E[X 2 } 2 . 

The following lemma is easy to prove, cf. ll30l . In fact, the equality there is 
a special case of Parseval's Identity in Harmonic Analysis, cf. [49 ]. 

Lemma 6. Let f = f(x±, . . . , x n ) be a polynomial in tl variables x\, . . . , x n 
each with domain { — 1, 1} such that f = YlicWil c l T\iel Xi ~ Define a random 
variable X by choosing a vector (ei, . . . , e n ) G { — 1, l} n uniformly at random 
and setting X = f(e\, . . . , e n ). Then E[X 2 ] = J2i<=i c r 

4 Parameterizations of MaxSat 

In the well-known problem MaxSat, we are given a CNF formula F with m 
clauses and asked to determine the maximum number of clauses of F that can 
be satisfied by a truth assignment. Let us assign True to each variable of F with 
probability 1/2 and observe that the probability of a clause to be satisfied is at 
least 1/2 and thus, by linearity of expectation, the expected number of satisfied 
clauses in F is at least m/2. Thus, by the averaging argument, there exists a 
truth assignment to the variables of F which satisfies at least m/2 clauses of F. 

Let us denote by sat(F) the maximum number of clauses of F that can be 
satisfied by a truth assignment. The lower bound sat(F) > m/2 is tight as we 
have sat(il) = m/2 if H = (x\) A (xi) A • • • A (x m / 2 ) A (x m / 2 )- Consider the 
following parameterization of MaxSat above tight lower bound introduced by 
Mahajan and Raman ll44ll . 

MAxSAT-A(m/2) 

Instance: A CNF formula F with m clauses (clauses may appear several 
times in F) and a nonnegative integer k. 
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Parameter: k. 

Question: sat(-F) > m/2 + kl 

Mahajan and Raman [44J proved that MaxSat-A(to/2) admits a kernel 
with at most 6k + 3 variables and 10k clauses. Crowston et al. lfl6l improved 
this result, by obtaining a kernel with at most 4k variables and (2\/5 + 4)k 
clauses. The improved result is a simple corollary of a new lower bound on 
sat(F) obtained in lfT6l . which is significantly stronger than the simple bound 
sat(F) > m/2. We give the new lower bound below, in Theorem[3] 

For a variable x in F, let m(x) denote the number of pairs of unit of clauses 
(x), (x) that have to be deleted from F such that F has no pair (x), (x) any 
longer. Let vax(F) be the set of all variables in F and let rh = Ylxevax(F) m ( x )- 
The following is a stronger lower bound on sat(F) than m/2. 

Theorem 1. For a CNF formula F, we have sat(F) > m/2+<J)(m — m), where 
$ = (y/5 - l)/2 rj 0.618. 

A CNF formula F is t-satisfiable if for any t clauses in F, there is a truth 
assignment which satisfies all of them. It is easy to check that F is 2-satisfiable 
if and only if rh = and clearly Theorem Q] is equivalent to the assertion that if 
F is 2-satisfiable then sat (F) > ifim. The proof of this assertion by Lieberherr 
and Specker [41] is quite long; Yannakakis [56] gave the following short proba- 
bilistic proof. For x G var(F), let the probability of x being assigned True be 
4> if (x) is in F, 1 — <p if (x) is in F, and 1/2, otherwise, independently of the 
other variables. Let us bound the probability p(C) of a clause C to be satisfied. 
If C contains only one literal, then, by the assignment above, p(C) = <j>. If C 
contains two literals, then, without loss of generality, C = (x V y). Observe that 
the probability of x assigned FALSE is at most <ft (it is if> if (x) is in F). Thus, 
p{C) > 1 — <p 2 - It remains to observe that 1 — <p 2 = <f>. Now to obtain the bound 
sat(F) > <j)m apply linearity of expectation and the averaging argument. 

Note that <j)m is an asymptotically tight lower bound: for each e > there 
are 2-satisfiable CNF formulae F with sat(F) < m(4> + e) iHTTl . Thus, the 
following problem stated by Mahajan and Raman [44] is natural. 

MAX-2S-SAT-A(0m) 

Instance: A 2-satisfiable CNF formula F with m clauses (clauses may 
appear several times in F) and a nonnegative integer k. 
Parameter: k. 

Question: sat(i ? ) > <pm + fc? 

Mahajan and Raman [44] conjectured that MAX-2S-SAT-A((^m) is fpt. 
Crowston et al. lfl6l solved this conjecture in the affirmative; moreover, they 
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obtained akernel with at most (7+3\ /r 5)k variables. This result is an easy corol- 
lary from a lower bound on sat(F) given in Theorem[3l which, for 2-satisfiable 
CNF formulas, is stronger than the one in Theorem Q] The main idea of |[T6l is 
to obtain a lower bound on sat(F) that includes the number of variables as a 
factor. It is clear that for general CNF formula F such a bound is impossible. 
For consider a formula containing a single clause c containing a large number 
of variables. We can arbitrarily increase the number of variables in the formula, 
and the maximum number of satisfiable clauses will always be 1 . We therefore 
need a reduction rule that cuts out 'excess' variables. Our reduction rule is based 
on the notion of an expanding formula given below. Lemma [7] and Theorem |2] 
show the usefulness of this notion. 

A CNF formula F is called expanding if for each X C var(F), the number 
of clauses containing at least one variable from X is at least \X\ 1121 1551 . The 
following lemma and its parts were proved by many authors, see, e.g., Fleis- 
chner, Kullmann and Szeider fT\\ , Lokshtanov ll43l and Szeider ll55l . 

Lemma 7. Let F be a CNF formula and let V and C be its sets of variables 
and clauses. There exists a subset C* C C that can be found in polynomial 
time, such that the formula F' with clauses C\C* and variables V \ V*, where 
V* = var(C*), is expanding. Moreover, sat(F) = sat(-F') + \C*\. 

The following result was shown by Crowston et al. lfl6l . The proof is non- 
trivial and consists of a deterministic algorithm for finding the corresponding 
truth assignment and a detailed combinatorial analysis of the algorithm. 

Theorem 2. Let F be an expending 2-satisfiable CNF formula with n variables 
and m clauses. Then sat(F) > 4>m + n(2 — 3(/>)/2. 

Lemma [7] and Theorem |2] imply the following: 

Theorem 3. Let F be a 2-satisfiable CNF formula and let V and C be its sets 
of variables and clauses. There exists a subset C* C C that can be found in 
polynomial time, such that the formula F' with clauses C \C* and variables 
V \ V* , where V* = var(C*), is expanding. Moreover, we have 

sat(F) > 4>m + (1 - 4>)m* + (n - n*)(2 - 3<£)/2, 

where m = \C\, m* = \C*\, n = \V\ and n* = \V*\. 

Let us turn now to 3-satisfiable CNF formulas. If F is 3-satisfiable then 
it is not hard to check that the forbidden sets of clauses are pairs of the form 
{x}, {x} and triplets of the form {x}, {y}, {x, y} or {x}, {x, y}, {x, y}, as well 
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as any triplets that can be derived from these by switching positive literals with 
negative literals. 

Lieberherr and Specker B2l and, later, Yannakakis ||56l proved the follow- 
ing: if F is 3-satisfiable then sat(F) > ^w(C(F)). This bound is also asymp- 
totically tight. Yannakakis [56] gave a probabilistic proof which is similar to 
his proof for 2-satisfiable formulas, but requires consideration of several cases 
and, thus, not as short as for 2-satisfiable formulas. For details of his proof, see, 
e.g., Gutin, Jones and Yeo [28] and Jukna [36] (Theorem 20.6). Yannakakis's 
approach was extended by Gutin, Jones and Yeo IT281 to prove the following 
theorem using a quite complicated probabilistic distribution for a random truth 
assignment. 

Theorem 4. Let F be an expanding 3-satisfiable CNF formula with n variables 
and m clauses. Then s&t(F) > |m + pn, where p(> 0.0019) is a constant. 

This theorem and Lemma [7] imply the following: 

Theorem 5. Let F be a 3-satisfiable CNF formula and let V and C be its sets 
of variables and clauses. There exists a subset C* C C that can be found in 
polynomial time, such that the formula F' with clauses C \C* and variables 
V \ V*, where V* = var(C*), is expanding. Moreover, we have 

2 1 

sat(i ? ) > -m H — m* + p(n — n*), 

3 3 

where p(> 0.0019) is a constant, m = \C\, m* = \C*\, n = \V\ and n* = 
\V*\. 

Using this theorem it is easy to obtain a linear-in-number-of- variables kernel 
for the following natural analog of MAX-2S-SAT-A(0m), see [28 ] for details. 

MAX-3S-SAT-A(|m) 

Instance: A 3-satisfiable CNF formula F with m clauses and a nonneg- 
ative integer k. 
Parameter: k. 

Question: s&t(F) > |m + kl 

Now let us consider the following important parameterization of r-SAT be- 
low the tight upper bound m: 

r-SAT-B(m) 

Instance: An r-CNF formula F with m clauses (every clause has at most 
r literals) and a nonnegative integer k. 
Parameter: k. 

Question: sat(i ? ) > m — kl 
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Since MAX-r-SAT is NP-hard for each fixed r > 3, r-SAT-B(m) is not fpt un- 
less P=NP. However, the situation changes for r = 2: Razgon and O'Sullivan 
|[52l proved that 2-SAT-B(m) is fpt. The algorithm in 11521 is of complexity 
0(15 k km 3 ) and, thus, MAX-2-SAT-B(m) admits a kernel with at most 15 k k 
clauses. It is not known whether 2-S AT-B(m) admits a kernel with a polynomial 
number of variables. Raman et al. II5T1 and Cygan et al. ifTTl designed algorithms 
for 2-SAT-B(m) of runtime 9 k (km)°^ and A k {km) 0<yl \ respectively. In both 
papers, the authors consider the following parameterized problem (VC-AMM): 
given a graph G whose maximum matching is of cardinality fj,, decide whether 
G has a vertex cover with at most \i + k vertices, where k is the parameter. A pa- 
rameterized algorithm of the above-mentioned complexity actually is obtained 
for VC-AMM, and 2-SAT-B(m) is polynomially transformed into VC-AMM 
(the transformation is parameter-preserving). While Raman et al. ifSTTl obtain the 
parameterized algorithm for VC-AMM directly, Cygan et al. IfTTl derive it via 
a reduction from a more general problem on graphs parameterized above a tight 
bound. 

2-Sat-B(to) has several application. 2-SAT-B(m) is, in fact, equivalent to 
VC-AMM H46I51I171 . Mishra et al. g6l studied the following problem: given 
a graph G, decide whether by deleting at most k vertices we can make G Kbnig, 
i.e., a graph in which the minimum size of a vertex cover equals the maxi- 
mum number of edges in a matching. They showed how to reduce the last prob- 
lem to VC-AMM. It is noted by Gottlob and Szeider [24] that fixed-parameter 
tractability of VC-AMM implies the fixed-parameter tractability of the follow- 
ing problem. Given a CNF formula F (not necessarily 2-CNF), decide whether 
there exists a subset of at most k variables of F so that after removing all oc- 
currences of these variables from the clauses of F, the resulting CNF formula is 
Renamable Horn, i.e., it can be transformed by renaming of the variables into a 
CNF formula with at most one positive literal in each clause. 

2-SAT-B(m) has also been used in order to obtain the best known bound 
on the order of a kernel for Vertex Cover (given a graph G and an integer 
k, decide whether G has a vertex cover with at most k vertices). The fact that 
Vertex Cover has a kernel with at most 2k vertices was known for a long 
time, see Chen, Kanj and Jia (3. This was improved to 2 k — 1 by Chlebfk and 
Clebfkova ifTOll and further to 2 k — c for any constant c by Soleimanfallah and 
Yeo [54]. Lampis [40] used the same approach as in [54], but instead of reduc- 
ing an instance of Vertex Cover to a large number of 2-SAT instances, he 
reduced Vertex Cover to 2-SAT-B(m) via VC-AMM. As a result, Lampis 
[40] obtained a kernel of order at most 2k — clog k for any constant c. We will 
now briefly describe how this kernel was obtained. 
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For a graph G let /3(G) denotes the minimum size of a vertex cover of G and 
/i(G) the maximum size of a matching in G. In their classical work Nemhauser 
and Trotter PTTTl proved the following: 

Theorem 6. There is an 0{\E\^J\V\)-time algorithm which for a given graph 
G = (V, E) computes two disjoint subsets of vertices of G, V, V", such that 
/3(G) = P(G[V']) + \V"\ andp{G[V'\) > \V'\/2. 

Soleimanfallah and Yeo ll54l showed the following additional inequality: 

P(G[V]) > \V'\ - KG). (1) 

Let k' :=k- \V"\. By TheoremU /3(G) < k if and only if /3(G[V']) < k'. 
If \V'\ < 2k' — clog k' < 2k — clog k then we have a kernel and we are done. 
Thus, it suffices to show that if \V'\ > 2k' — clogk' we can decide whether 
f3(G[V']) < k' in polynomial time. We assume that \ V'\ > 2k' — clogk' and 
we may also assume that \ V'\ < 2k' as otherwise {3(G[V']) > k' by Theorem [6] 
By ©if KG[V']) < (\V'\ - c log £0/2 then /3(G[T/']) > (\V'\ + clog k')/2. 
Since \V'\ > 2k' - clogk' this means that f3(G[V']) > k'. 

So, consider the case fi(G[V'}) > (\V'\ - clog k')/2. Since \V'\ > 2k' - 
clog k' and fi(G[V'}) > (\V'\ -clogk')/2, we have p,(G[V']) > k' -clogk' 
and so k' < fi(G[V']) + clog k'. Thus, to decide whether (3(G[V']) < k' it suf- 
fices to compute I such that /3(G[V]) = p,(G[V']) + 1, where £ < clog k', and 
to compare fj,(G[V']) +1 with k' . Using an fpt algorithm for VC-AMM (which 
is essentially an fpt algorithm for Max-2-Sat-B(?ti) as the two problems are 
equivalent) we can compute I in fpt time (provided we use an efficient algorithm 
such as in H52I51I17II ). 

5 Boolean Max-r-CSPs Above Average 

Throughout this section, r is a positive integral constant. Recall that the problem 
MAX-r-CSP-AA is given by a set V of n variables and a set of m Boolean 
formulas; each formula is assigned an integral positive weight and contains at 
most r variables from V. Averaging over all truth assignments, we can find the 
average value A of the weight of satisfied formulas. We wish to decide whether 
there exists a truth assignment that satisfies formulas of total weight at least 
A + k, where k is the parameter (k is a nonnegative integer). 

Recall that the problem MAX-r-LlN2-AA is a special case of MAX-r- 
CSP-AA when every formula is a linear equation over F2 with at most r vari- 
ables and that MAX-LIN2-AA is the extension of MAX-r-LlN2-AA when we 
do not bound the number of variables in an equation. Research of both MAX-r- 
LIN2-AA and Max-Lin2-AA led to a number of basic notions and results of 
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interest for both problems, and we devote Subsection 15. II to these notions and 
results. In particular, we will show that A = W/2, where W is the total weight 
of all equations, introduce a Gaussian-elimination-type algorithm for both prob- 
lems, and a notion and simple lemma of a sum-free subset of a set of vectors 
in F!}. This lemma is a key ingredient in proving some important results for 
MAX-r-LlN2-AA and Max-Lin2-AA. 

MAX-r-LlN2-AA is studied in Subsection |5.21 where we give proof schemes 
of a result by Gutin et al. (30| that MAX-r-LlN2-AA has a kernel of quadratic 
size and a result of Crowston, Fellows et al. |12] that MAX-r-LlN2-AA has a 
kernel with at most (2k — l)r variables. The latest result improves that of Kim 
and Williams [39] that MAX-r-LlN2-AA has a kernel with at most r(r + l)k 
variables. 

In Subsection l5.3l we give a proof scheme of a result by Alon et al. (2| that 
MAX-r-CSP-AA has a a kernel of polynomial size. The main idea of the proof 
is to reduce MAX-r-CSP-AA to MAX-r-LlN2-AA and use the above results 
on MAX-r-LlN2-AA and LemmaQ] This shows the existence of a polynomial- 
size kernel, but does not allow us to obtain a bound on the degree of the poly- 
nomial. Nevertheless, this solves an open question of Mahajan, Raman and Sik- 
dar I43l not only for MAX-r-SAT-AA but also for the more general problem 
MAX-r-CSP-AA. Recall that the problem MAX-r-SAT-AA is a special case 
of MAX-r-CSP-AA when every formula is a clause with at most r variables. 
For MAX-r-SAT-AA, the reduction to MAX-r-LlN2-AA can be complemented 
by a reduction from MAX-r-LlN2-AA back to MAX-r-SAT-AA, which yields 
a kernel of quadratic size. 

5.1 Basic Results for Max-Lin2-AA and Max-r-Lin2-AA 

Recall that in the problems MaxLin2-AA and MAX-r-LlN2- AA, we are given 
a system S consisting of m linear equations in n variables over F2 in which each 
equation is assigned a positive integral weight. In MAX-r-LlN2-AA, we have 
an extra constraint that every equation has at most r variables. Let us write the 
system S as ^2 ieI z% = bj, I G F, and let wj denote the weight of an equation 
Y^iei z i = bi- Clearly, m = (J 7 ]. Let W = ^/ e jr^/ and let sat (5) be the 
maximum total weight of equations that can be satisfied simultaneously. 

For each i 6 [n], set z% = 1 with probability 1/2 independently of the rest 
of the variables. Then each equation is satisfied with probability 1/2 and the 
expected weight of satisfied equations is W/2 (as our probability distribution is 
uniform, W/2 is also the average weight of satisfied equations). Hence W/2 is 
a lower bound; to see its tightness consider a system of pairs of equations of the 
form J2iei z i = 0) Yli&i «t = 1 of weight 1. The aim in both MAX-LIN2-AA 
and MAX-r-LlN2-AA is to decide whether for the given system S, sat(S) > 



14 



W/2+k, where k is the parameter. It is well-known that, in polynomial time, we 
can find an assignment to the variables that satisfies equations of total weight 
at least W/2, but, for any e > it is NP-hard to decide whether there is an 
assignment satisfying equations of total weight at least W(l + e)/2 (33). 

Henceforth, it will often be convenient for us to consider linear equations in 
their multiplicative form, i.e., instead of an equation J2iei Zi = with Zi G 
{0,1}, we will consider the equation Yliei x i = ( — -Q 6/ with x i ^ { — 1>1}- 
Clearly, an assignment z° = (z^, . . . , z®) satisfies Yliei z i = hi ^ an( ^ OIU y 
if the assignment x° = (x?, . . . satisfies niei" x * = ( — l) 6l > where x® = 
(— l) z i for each i G [n]. 

Let e(x) = ^2i £ jrWi(—l) bl Yl ieI Xi (each G {—1,1}) and note that 
e(x°) is the difference between the total weight of satisfied and falsified equa- 
tions when Xi = x® for each i £ [n]. Crowston et al. Ifl4l call e{x) the excess 
and the maximum possible value of e{x) the maximum excess. 

Remark 1. Observe that the answer to Max-Lin2-AA and MAX-r-LlN2-AA 
is Yes if and only if the maximum excess is at least 2k. 

Let A be the matrix over F2 corresponding to the set of equations in S, such 
that ciji = 1 if i G Ij and 0, otherwise. 

Consider two reduction rules for Max-Lin2-AA introduced by Gutin et al. 
QUI . Ruled] was studied before in P4l . 

Reduction Rule 1 If we have, for a subset I of [n], an equation Yli^i x i = 
b'j with weight w'j, and an equation Yl ieI X{ = b'j with weight w", then we 
replace this pair by one of these equations with weight w'j + w'i if b'j = b'j and, 
otherwise, by the equation whose weight is bigger, modifying its new weight to 
be the difference of the two old ones. If the resulting weight is 0, we delete the 
equation from the system. 

Reduction Rule 2 Let t = rankj4 and suppose columns a 11 , . . . , a H of A are 
linearly independent. Then delete all variables not in {x^, . . . ,Xi t } from the 
equations of S. 

Lemma 8. H30\l Let S' be obtained from S by Rule\J}or^\ Then the maximum 
excess of S' is equal to the maximum excess of S. Moreover, S' can be obtained 
from S in time polynomial in n and m. 

If we cannot change a weighted system S using Rules Q] and |2l we call it 

irreducible. 

Let S be an irreducible system of Max-Lin2-AA. Consider the following 
algorithm introduced in lfl4l . We assume that, in the beginning, no equation or 
variable in S is marked. 
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Algorithm % 

While the system S is nonempty do the following: 

1. Choose an equation Yl ieI %i = b and mark a variable x\ such that I G /. 

2. Mark this equation and delete it from the system. 

3. Replace every equation flier Xi = ^ ^ n tne s y stem containing xi by 
Hie/AT' x i = bb', where IAI' is the symmetric difference of / and I' (the 
weight of the equation is unchanged). 

4. Apply Reduction Rule[T]to the system. 

The maximum H-excess of S is the maximum possible total weight of equa- 
tions marked by % for S taken over all possible choices in Step 1 of %. The 
following lemma indicates the potential power of H. 

Lemma 9. 1/41/ Let S be an irreducible system. Then the maximum excess of S 
equals its maximum Ti-excess. 

This lemma gives no indication on how to choose equations in Step 1 of 
Algorithm %. As the problem MAX-LIN2-AA is NP-hard, we cannot hope to 
obtain an polynomial-time procedure for optimal choice of equations in Step 
1 and, thus, have to settle for a good heuristic. For the heuristic we need the 
following notion first used in lfl4l . Let K and M be sets of vectors in F?> such 
that K C M. We say K is M -sum-free if no sum of two or more distinct vectors 
in K is equal to a vector in M. Observe that K is M-sum-free if and only if K 
is linearly independent and no sum of vectors in K is equal to a vector in M\K. 

The following lemma was proved implicitly in fBl and, thus, we provide a 
short proof of this result. 

Lemma 10. Let S be an irreducible system of MAX-LIN2-AA and let A be 
the matrix corresponding to S. Let M be the set of rows of A (viewed as vectors 
in F2 ) and let K be an M-sum-free set of k vectors. Let w m i Q be the minimum 
weight of an equation in S. Then, in time in (nm)°^\ we can find an assign- 
ment to the variables of S that achieves excess of at least w m i n ■ k. 

Proof. Let {ej 1 , . . . , 6j k } be the set of equations corresponding to the vectors in 
K. Run Algorithm Ti, choosing at Step 1 an equation of S from {ej 1 , . . . , e Jfc } 
each time, and let S' be the resulting system. Algorithm % will run for k itera- 
tions of the while loop as no equation from {e^ , . . . , ej k } will be deleted before 
it has been marked. 

Indeed, suppose that this is not true. Then for some and some other equa- 
tion e in S, after applying Algorithm % for at most / — 1 iterations e J; and 
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e contain the same variables. Thus, there are vectors Vj £ K and v 6 M 
and a pair of nonintersecting subsets iv"' and if" of K \ {v,Vj} such that 
v j + J2ueK> u = v + YsuaK" u - Tnus > u = v j + J2ueK'uK" u > a contradiction 



5.2 Max-r-Lin2-AA 

The following result was proved by Gutin et al. QUI . 

Theorem 7. The problem MAX-r-LlN2-AA admits a kernel with at most 0(k 2 ) 
variables and equations. 

Proof. Let the system S be irreducible. Consider the excess 



Let us assign value —1 or 1 to each xi with probability 1/2 independently of the 
other variables. Then X = e(x) becomes a random variable. By Lemma|6l we 
have K(X 2 ) = X^/ej-^/- Therefore, by Lemmas [3] and @J 



Hence by RemarkQ] if y/m/(2-3 r ) > 2k, then the answer to MAX-r-LlN2-AA 
is Yes. Otherwise, m = 0{k 2 ) and, by Rule|2j we have n < m = 0(k 2 ). □ 

The bound on the number of variables can be improved and it was done by 
Crowston et al. lfl4l and Kim and Williams [39]. The best known improvement 
is by Crowston, Fellows et al. lfl2l : 

Theorem 8. The problem MAX-r-LlN2-AA admits a kernel with at most {2k— 
l)r variables. 

This theorem can be easily proved using Formula ©, Lemma [lOl and the 
following result by Crowston, Fellows et al. lfl2l . 

Lemma 11. Let M be a set of vectors in FJ? such that M contains a basis of 
F2 . Suppose that each vector of M contains at most r non-zero coordinates. If 
k > 1 is an integer and n > r{k — 1) + 1, then in time \M\°( 1 \ we can find a 
subset KofMofk vectors such that K is M -sum-free. 

Both Theorem [8] and a slightly weaker analogous result of ||39l imply the 
following: 



to the definition of K. 



□ 




(2) 



F[X > y/m/(2 ■ 3 r ) ] > P X> 




17 



Corollary 1. There is an algorithm of runtime 2°^ +m°W for MAX-r-LlN2- 
AA. 

Kim and Williams [39 ] proved that the last result is best possible, in a sense, 
if the Exponential Time Hypothesis holds. 

Theorem 9. [39] If Max-3-Lin2-AA can be solved in 0{2 tk 2 em ) time for 
every e > 0, then 3-SAT can be solved in 0{2 &n ) time for every 5 > 0, where n 
is the number of variables. 

5.3 Max-r-CSPs AA 

Consider first a detailed formulation of MAX-r-CSP-AA. Letl/ = {v\, . . . ,v n } 
be a set of variables, each taking values —1 (True) and 1 (False). We are given 
a set of Boolean functions, each involving at most r variables, and a collec- 
tion F of m Boolean functions, each / G F being a member of <P, each with a 
positive integral weight and each acting on some subset of V. We are to decide 
whether there is a truth assignment to the n variables such that the total weight 
of satisfied functions is at least A + k, where A is the average weight (over all 
truth assignments) of satisfied functions and k is the parameter. 

Note that A is a tight lower bound for the problem, whenever the family 
<P is closed under replacing each variable by its complement, since if we apply 
any Boolean function to all 2 r choices of literals whose underlying variables are 
any fixed set of r variables, then any truth assignment to the variables satisfies 
exactly the same number of these 2 r functions. 

Note that if $ consists of clauses, we get MAX-r-SAT-AA. In MAX-r-SAT- 
AA, A = Y^j=i w j(l ~ 2 _rj ), where Wj and rj are the weight and the number 
of variables of Clause j, respectively. Clearly, A is a tight lower bound for Max- 
r-SAT. 

Following 5 for a Boolean function / of weight w(f) and on r(f) < 
r Boolean variables x^, . . . , we introduce a polynomial hf(x), x = 
(zi, . . . ,x n ) as follows. Let Sf C {—1, l} r W denote the set of all satisfying 
assignments of /. Then 

r(f) 

h f (x) = w(f)2 r ^ Yl IK 1 + - ^ 

(v 1 ,...,v r(f) )£Sf j=i 

Let h(x) = J^feJ 7 hf( x )- It i s eas Y to see ( c f- (21) that the value of h(x) at 
some x° is precisely 2 r (U — A), where U is the total weight of the functions 
satisfied by the truth assignment x°. Thus, the answer to MAX-r-CSP-AA is 
Yes if and only if there is a truth assignment x° such that h(x ) > k2 r . 
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Algebraic simplification of h(x) will lead us the following (Fourier expan- 
sion of h{x), cf. |49l ): 



where T = {0 / S C [n] : c 5 7^ 0,|S| < r}. Thus, (J 7 ) < n r . The 
sum 2~2se T c s ILeS x » can ^ e y i ewe d as the excess of an instance of MAX-r- 
LIN2-AA and, thus, we can reduce MAX-r-CSP-AA into MAX-r-LlN2-AA 
in polynomial time (since r is fixed, the algebraic simplification can be done in 
polynomial time and it does not matter whether the parameter of MAX-r-LlN2- 
AA is k or k' = k2 r ). By Theorem [T8l MAX-r-LlN2-AA has a kernel with 
0(k 2 ) variables and equations. This kernel is a bikernel from MAX-r-CSP- 
AA to MAX-r-LlN2-AA. Thus, by LemmaQ] we obtain the following theorem 
of Alon etal. (2). 

Theorem 10. MAX-r-CSP-AA admits a polynomial-size kernel. 

Applying a reduction from MAX-r-LlN2-AA to MAX-r-SAT-AA in which 
each monomial in © is replaced by 2 r_1 clauses, Alon et al. [2 J obtained the 
following: 

Theorem 11. MAX-r-SAT-AA admits a kernel with 0{k 2 ) clauses and vari- 
ables. 

Using also Theorem [U it is easy to improve this theorem with respect to 
the number of variables in the kernel. This result was first obtained by Kim and 
Williams 1591 

Theorem 12. MAX-r-SAT-AA admits a kernel with 0(k) variables. 
6 MaxLin2-AA and MaxSat-AA 

Recall that MaxLin2-AA is the same problem as MAX-r-LlN2-AA, but the 
number of variables in an equation is not bounded. Thus, MaxLin2-AA is a 
generalization of MAX-r-LlN2-AA. In this section we present a scheme of a 
recent proof by Crowston, Fellows et al. [12J that MaxLin2-AA is fpt and has 
a kernel with polynomial number of variables. This result finally solved an open 
question of Mahajan, Raman and Sikdar P31 . Still, we do not know whether 
MaxLin2-AA has a kernel of polynomial size and we are able to give only 
partial results on the topic. 

Theorem 13. [12] The problem MaxLin2-AA has a kernel with at most 0(k 2 log k) 




(3) 



variables. 
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The proof of this theorem in lPT2l which we give later is based on Theorems 
[HandCE] 

Theorem 14. / fi?l/ Let S be an irreducible system of MaxLin2-AA and let 

k > 2. If k < m < 2 n// ^ fc_1 ) — 2, then the maximum excess of S is at least k. 
Moreover, we can find an assignment with excess of at least k in time mP^. 

This theorem can easily be proved using Lemma[lO]and the following lemma. 

Lemma 12. [14] Let M be a set in F^ such that M contains a basis o/Fg, the 
zero vector is in M and \M\ < 2 n . Ifk is a positive integer and k + 1 < \M\ < 
2 n l k then, in time \M\^^ , we can find an M -sum-free subset K of M with at 
least k + 1 vectors. 

Theorem 15. / [72l/ There exists an n 2k (nm) 0( -^ -time algorithm for MaxLin2- 
AA that returns an assignment of excess of at least 2k if one exists, and returns 
No otherwise. 

The proof of this theorem in lfT2l is based on constructing a special depth- 
bounded search tree. 

Now we will present the proof of Theorem [14] from lfl2l . 
Proof of Theorem [I4l Let C be an instance of MaxLin2-AA and let 5 be 
the system of C with m equations and n variables. We may assume that S is 
irreducible. Let the parameter k be an arbitrary positive integer. 

If m < 2k then n < 2k = 0(k 2 log k). If 2k < m < 2 n /( 2fc ~ 1 ) - 2 then, 
by Theorem [14] and Remark [T] the answer to C is YES and the corresponding 
assignment can be found in polynomial time. If m > n 2k then, by Theorem [T5l 
we can solve C in polynomial time. 

Finally we consider the case 2 n /( 2fe_1 ) — 1 < m < n 2k — 1. Hence, n 2k > 
2 n/(2fc-l)_ Therefore, Ak 2 > 2k + n/logn > and n < (2k) 4 . Hence, 
n < Ak 2 \ogn < Ak 2 log(16/c 4 ) = 0(fc 2 logA;). 

Since S is irreducible, m < 2 n and thus we have obtained the desired kernel. 

□ 

Now let us consider some cases where we can prove that MaxLin2-AA 
has a polynomial-size kernel. Consider first the case when each equation in 5 
has odd number of variables. Then we have the following theorem proved by 
Gutin etal. QUI . 

Theorem 16. The special case 0/MAXLIN2-AA when each equation in S has 
odd number of variables, admits a kernel with at most Ak 2 variables and equa- 
tions. 
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Proof. Let the system S be irreducible by RuleQ] Consider the excess e(x) = 

J2ieJ rWl (~^-) bl Tiiei Xi - ^ et us ass ig n value —1 or 1 to each X{ with proba- 
bility 1/2 independently of the other variables. Then e(x) becomes a random 
variable. Since e(— x) = —e(x), e(x) is a symmetric random variable. Let 
X = e(x). By Lemma|6j we have K(X 2 ) = Yliei Therefore, by Lemma[2l 

P( X > ) > P( X > Jf^U w) ) > 0. Hence, if > 2k, the answer 

to MAXLIN2-AA is YES. Otherwise, m < 4k 2 and, after applying Rule|2l we 
have n < m < 4k 2 . □ 

In fact, Gutin et al. [30] proved the following more general result. 

Theorem 17. The following special case of MaxLin2-AA admits a kernel 
with at most 4k 2 variables and equations: there exists a subset U of variables 
such that each equation in Ax = b has odd number of variables from U. 

Let us turn to results on MaxLin2-AA that do not require any parity condi- 
tions. One such result is Theorem |7J Gutin et al. OOl also proved the following 
'dual' theorem. 

Theorem 18. Let p > 1 be a fixed integer. Then MaxLin2-AA restricted to 
instances where no variable appears in more than p equations, admits a kernel 
with 0(k 2 ) variables and equations. 

The proof is similar to that of Theorem |7J but Lemma [5] (in fact, its weaker 
version obtained in l30l ) is used instead of Lemma|4] 

Recall that MaxSat-AA is the same problem as MAX-r-SAT-AA, but the 
number of variables in a clause is not bounded. Crowston et al. lfT5l proved that 
MaxSat-AA is para-NP-complete and, thus, MaxSat-AA is not fpt unless 
P=NP. This is in sharp contrast to MaxLin2-AA. This result is a corollary of 
the following: 

Theorem 19. [15] MAX-r(n)-S AT- AA is para-NP-complete for r(n) = [logn]. 

The Exponential Time Hypothesis (ETH) claims that 3 -SAT cannot be solved 
in time 2°^ , where n is the number of variables (see, e.g., 11221481). Using ETH, 
we can improve Theorem [T9l 

Theorem 20. fcl~5l Assuming ETH. MAX-r(ra)-SAT-AA is not fpt for any r(n) > 
log logn + (ft(n), where 4>(n) is any unbounded strictly increasing function of 
n. 

The following theorem shows that Theorem [20l provides a bound on r(n) 
which is not far from optimal. 
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Theorem 21. [15]MAX-r(n)-SAT-AAisfptforr(n) < loglogn— logloglogn— 
4>(n),for any unbounded strictly increasing function (j)(n). 

7 Ordering CSPs 

In this section we will discuss recent results in the area of Ordering Constraint 
Satisfaction Problems (Ordering CSPs) parameterized above average. Order- 
ing CSPs include several well-known problems such as Betweenness, Cir- 
cular Ordering and Acyclic Subdigraph (which is equivalent to 2- 
Linear Ordering). These three problems have applications in circuit design 
and computational biology 1111501 . in qualitative spatial reasoning ||35ll . and in 
economics ll53ll . respectively. 

Let us define Ordering CSPs of arity 3. The reader can easily generalize it 
to any arity r > 2 and we will do it below for Linear Ordering of arity r. 
Let V be a set of n variables and let 

n C S 3 = {(123), (132), (213), (231), (312), (321)} 

be arbitrary. A constraint set over V is a multiset C of constraints, which are per- 
mutations of three distinct elements of V. A bijection a : V — > [n] is called an 
ordering of V. For an ordering a : V — >■ [n], a constraint (vi, «2, Vs) 6 C is 77- 
satisfied by a if there is a permutation it G 77 such that a(%m) < 0(^(2)) < 
a { v n(3))- Thus, given 77 the problem 77-CSP, is the problem of deciding if there 
exists an ordering of V that 77-satisfies all the constraints. Every such problem 
is called an Ordering CSP of arity 3. We will consider the maximization version 
of these problems, denoted by MAX-77-CSP, parameterized above the average 
number of constraints satisfied by a random ordering of V (which can be shown 
to be a tight bound). 

Guttmann and Maucher ll32l showed that there are in fact only 13 distinct 
77-CSP's of arity 3 up to symmetry, of which 11 are nontrivial. They are listed 
in Tabled] together with their complexity. Note that if 77 = {(123), (321)} then 
we obtain the BETWEENNESS problem and if 77 = {(123)} then we obtain 
3 -Linear Ordering. 

Gutin et al. 11271 proved that all 11 nontrivial MAX-77-CSP problems are 
NP-hard (even though four of the 77-CSP are polynomial). 

Now observe that given a variable set V and a constraint multiset C over 
V, for a random ordering a of V, the probability of a constraint in C be- 
ing 77-satisfied by a equals ^jP-. Hence, the expected number of satisfied con- 
straints from C is ^p|C|, and thus there is an ordering a of V satisfying at least 
^|C| constraints (and this bound is tight). A derandomization argument leads 
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n c s 3 



Name 



Complexity 



n ■ 
n 1 -■ 
n 2 ■- 
n 3 ■- 
n 4 ■- 
n 5 ■- 
n 6 : 
n 7 = 
n a ■ 
n 9 ■- 
n w 



{(123)} 

{(123), (132)} 

{(123), (213), (231)} 

{(132), (231), (312), (321)} 

{(123), (231)} 

{(123), (321)} 

{(123), (132), (231)} 

{(123), (231), (312)} 

S z \ {(123), (231)} 

S 3 \ {(123), (321)} 

= «5 3 \{(123)} 



Linear Ordering- 3 polynomial 



Betweenness NP-comp. 

NP-comp. 

Circular Ordering NP-comp. 

NP-comp. 

Non-Betweenness NP-comp. 

NP-comp. 



polynomial 
polynomial 
polynomial 
NP-comp. 



Table 1. Ordering CSPs of arity 3 (after symmetry considerations) 



to '—^ -approximation algorithms for the problems MAX-77j-CSP [ 8 ]. No better 
constant factor approximation is possible assuming the Unique Games Conjec- 
ture m. 

We will study the parameterization of MAX-77j-CSP above tight lower 
bound: 

77- Above Average (77- AA) 

Input: A finite set V of variables, a multiset C of ordered triples of distinct 

variables from V and an integer k > 0. 
Parameter: k. 

Question: Is there an ordering a of V such that at least ^p|C| + k constraints 
of C are 77-satisfied by at 

In ll27l it is shown that all 11 nontrivial 77-CSP-AA problems admit ker- 
nels with 0{k 2 ) variables. This is shown by first reducing them to 3-LlNEAR 
Ordering- AA (or 2-Linear Ordering-AA), and then finding a kernel for 
this problem, which is transformed back to the original problem. The first trans- 
formation is easy due to the following: 

Proposition 1. / T27l/ Let II be a subset of S3 such that U ^ {0,53}. There 
is a polynomial time transformation f from II -AA to 3-LlNEAR ORDERING- 
AA such that an instance (V,C, k) of II -AA is a YES-instance if and only if 
(V,C , k) = f(V,C, k) is a YES-instance o/3-LlNEAR ORDERING-AA. 
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Proof. From an instance (V, C, k) of il-AA, construct an instance (V, Co, k) of 
3-Linear Ordering- AA as follows. For each triple (v±, v 2 , v 3 ) G C, add |77| 
triples (%(!), 1^(2), %(3)). vr e 77, to C . 

Observe that a triple (^1,^2,^3) G C is 77-satisfied if and only if exactly one 
of the triples (v^m, v^i 2 ) > v w(3))-> 71 G is satisfied by 3 -LINEAR ORDERING. 
Thus, ^|C| + fc constraints from C are 77-satisfied if and only if the same 
number of constraints from Co are satisfied by 3-LlNEAR ORDERING. It remains 
to observe that ^|C| + k= ||C | + fc as |C | = |i7| • |C|. □ 

Recall that the maximization version of r-LlNEAR ORDERING (r > 2) can 
be defined as follows. An instance of such a problem consists of a set of vari- 
ables V and a multiset of constraints, which are ordered r-tuples of distinct 
variables of V (note that the same set of r variables may appear in several dif- 
ferent constraints). The objective is to find an ordering a of V that maximizes 
the number of constraints whose order in a follows that of the constraint (we 
say that these constraints are satisfied). It is well-known that 2-LlNEAR OR- 
DERING is NP-hard (it follows immediately from the fact proved by Karp ll37l 
that the feedback arc set problem is NP-hard). It is easy to extend this hardness 
result to all r-LlNEAR ORDERING problems (for each fixed r > 2). Note that in 
r-LiNEAR Ordering Above Average (r-LiNEAR Ordering-AA), given 
a multiset C of constraints over V we are to decide whether there is an ordering 
of V that satisfies at least \C\/rl + k constraints. 

(2,3)-Linear Ordering is a mixture of 2-Linear Ordering and 3- 
Linear Ordering, where constraints can be of both arity 2 and 3. 

We proceed by first considering 2-LlNEAR ORDERING (Subsection 17. IK 
Betweenness (Subsection 17721). and 3-Linear Ordering (Subsection 17.31) 
separately and proving the existence of a kernel with a quadratic number of vari- 
ables and constraints for their parameterizations above average. We will con- 
clude the section by briefly overviewing the result of Kim and Williams [ 39 ] 
that (2,3)-Linear Ordering has a kernel with a linear number of variables 
(Subsection 17741. By considering (2,3)-LlNEAR ORDERING rather than just 3- 
LlNEAR ORDERING separately, Kim and Williams managed to obtain a finite 
set of reduction rules which appear to be impossible to obtain for 3 -Linear 
Ordering only (see Subsection 17731. 

7.1 2-Linear Ordering 

Let D = (V, A) be a digraph on n vertices with no loops or parallel arcs in 
which every arc ij has a positive integral weight Wij. Consider an ordering 
a : V [n] and the subdigraph D a = (V, {ij 6 A : a(i) < a(j)}) of 
D. Note that D a is acyclic. The problem of finding a subdigraph D a of D 
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of maximum weight is equivalent to 2-LlNEAR ORDERING (where the arcs 
correspond to constraints and weights correspond to the number of occurrences 
of each constraint). 

It is easy to see that, in the language of digraphs, 2-LlNEAR ORDERING- 
AA can be formulated as follows. 

2-Linear Ordering Above Average (2-Linear Ordering-AA) 
Instance: A digraph D = (V, A), each arc ij has an integral positive 
weight w-ij, and a positive integer k. 
Parameter: The integer n. 

Question: Is there a subdigraph D a of D of weight at least W/2 + k, 
where W = }~2ijeA Wij ? 

Mahajan, Raman, and Sikdar g5l asked whether 2-LlNEAR ORDERING- 
AA is fpt for the special case when all arcs are of weight 1. Gutin et al. [30] 
solved the problem by obtaining a quadratic kernel for the problem. In fact, the 
problem can be solved using the following result of Alon [T]: there exists an or- 
dering a such that D a has weight at least + 16 lyi )W. However, the proof in 
ifTTl uses a probabilistic approach for which a derandomization is not known 
yet and, thus, we cannot find the appropriate a deterministically. Moreover, 
the probabilistic approach in HI is quite specialized. Thus, we briefly describe 
a solution from Gutin et al. QUI based on Strictly-Above-Below-Expectation 
Method (introduced in QUI ). 

Consider the following reduction rule: 

Reduction Rule 3 Assume D has a directed 2-cycle iji; if w^ = Wji delete 
the cycle, if w^ > Wji delete the arc ji and replace w^ by w^ — Wji, and if 
Wji > Wij delete the arc ij and replace Wji by Wji — Wij. 

It is easy to check that the answer to 2-Linear Ordering-AA for a digraph D 
is Yes if and only if the answer to 2-Linear Ordering-AA is YES for a digraph 
obtained from D using the reduction rule as long as possible. A digraph is called 
an oriented graph if it has no directed 2-cycle. Note that applying Rule[3]as long 
as possible results in an oriented graph. 

Consider a random ordering: a : V — > [n] and a random variable X(a) = 
lz~2ijeA x ij( a )> wnere x ij((x) = w ij if < a(j) and Xy(a) = -iity, 
otherwise. It is easy to see that X (a) = 2~2{ w ij '■ ij £ A,a(i) < a(j)} — W/2. 
Thus, the answer to 2-Linear Ordering-AA is YES if and only if there is an 
ordering a : V — > [n] such that X(a) > k. Since E(xjj) = 0, we have K(X) = 
0. 

Let = Y,ijeA w ij- Gutin et al - I 30 ! P roved the following: 
Lemma 13. If D is an oriented graph, then E(X 2 ) > /12. 
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Since X{— a) = —X{a), where — a(i) = n + 1 — a(i), X is a symmetric 
random variable and, thus, we use a proof similar to that of Theorem [16] (but 
applying Lemma [13] instead of Lemma [6]) to show the following: 

Theorem 22. [30] 2-Linear Ordering-AA has a kernel with O(kP) arcs. 

By deleting isolated vertices (if any), we can obtain a kernel with 0(k 2 ) arcs 
and vertices. Kim and Williams ||39l proved that 2-LlNEAR ORDERING has a 
kernel with a linear number of variables. 

7.2 Betweenness 

Let V = {vi, . . . , v n } be a set of variables and let C be a multiset of m be- 
tweenness constraints of the form (vi, {vj, i>fc}). For an ordering a : V — > [n], 
a constraint (vi,{vj,Vk}) is satisfied if either a(vj) < a(v{) < a(vk) or 
a(vk) < a(vi) < a(vj). In the BETWEENNESS problem, we are asked to find 
an ordering a satisfying the maximum number of constraints in C. BETWEEN- 
NESS is NP-hard as even the problem of deciding whether all betweenness con- 
straints in C can be satisfied by an ordering a is NP-complete [50]. 

Let a : 7 -> [n] be a random ordering and observe that the probabil- 
ity of a constraint in C to be satisfied is 1/3. Thus, the expected number of 
satisfied constraints is m/3. A triple of betweenness constraints of the form 
(v, {u, w}), (u, {v, w}), (w, {v, u}) is called a complete triple. Instances of BE- 
TWEENNESS consisting of complete triples demonstrate that m/3 is a tight 
lower bound on the maximum number of constraints satisfied by an ordering 
a. Thus, the following parameterization is of interest: 

Betweenness Above Average (Betweenness-AA) 
Instance: A multiset C of m betweenness constraints over variables V 
and an integer k > 0. 
Parameter: The integer k. 

Question: Is there an ordering a : V — > [n] that satisfies at least m/3 +k 
constraints from C? 

In order to simplify instances of BETWEENNESS- A A we introduce the fol- 
lowing reduction rule. 

Reduction Rule 4IfC has a complete triple, delete it from C. Delete from V 
all variables that appear only in the deleted triple. 

Benny Chor's question (see [48 , p. 43]) to determine the parameterized com- 
plexity of Betweenness-AA was solved by Gutin et al. [29] who proved that 
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BETWEENNESS-AA admits a kernel with 0(k 2 ) variables and constraints (in 
fact, [29] considers only the case when C is a set, not a multiset, but the proof 
for the general case is the same GTTD . Below we briefly describe the proof in 

m. 

Suppose we define a random variable X(a) just as we did for 2-LlNEAR 
ORDERING. However such a variable is not symmetric and therefore we would 
need to use Lemma [6] on X(a). The problem is that a is a permutation and in 
Lemma[6]we are looking at polynomials, / = f(x\, X2 ■ ■ ■ , x n ), over variables 
xi,...,x n each with domain { — 1, 1}. In order to get around this problem the 
authors of 11291 considered a different random variable g (Z), which they defined 
as follows. 

Let Z = (z±, Z2, ■ ■ ■ , Z2n) be a set of 2n variables with domain {—1, 1}. 
These 2n variables correspond to n variables z*,^, . . . , z* such that Z2i-i and 
Z2i form the binary representation of z*. That is, z* is 0, 1, 2 or 3 depending on 
the value of (z 2 i-i,z 2 i) G {(-1, -1), (-1, 1), (1, -1), (1, 1)}. An ordering: 
a : V — >■ [n] complies with Z if for every a(i) < a(j) we have z* < z*. We 
now define the value of g(Z) as the average number of constraints satisfied over 
all orderings which comply with Z. Let f(Z) = g{Z) — m/3, and by Lemma[T4l 
we can now use Lemma[6]on f(Z) as it is a polynomial over variables whose do- 
main is {—1, 1}. We consider variables Zi as independent uniformly distributed 
random variables and then f(Z) is also a random variable. In |[29l it is shown 
that the following holds if Reduction Rule [4] has been exhaustively applied. 

Lemma 14. The random variable f(Z) can be expressed as a polynomial of 
degree 6. We have E[/(Z)] = 0. Finally, iff(Z) > k for some Z £ {-1, l} 2n 
then the corresponding instance o/BETWEENNESS-AA is a YES-instance. 

Lemma 15. [27] For an irreducible (by Reduction Rule^j) instance we have 
nf(Z) 2 } > j±m. 

Theorem 23. [27] BETWEENNESS-AA has a kernel of size 0(k 2 ). 

Proof Let (V, C) be an instance of BETWEENNESS- AA. We can obtain an irre- 
ducible instance (V' , C) such that (V, C) is a YES-instance if and only if (V', C) 
is a YES-instance in polynomial time. Let m! = \C'\ and let f(Z) be the random 
variable defined above. Then f(Z) is expressible as a polynomial of degree 6 
by LemmaEl hence it follows from Lemmag]that E[/(Z) 4 ] < 2 36 E[/(Z) 2 ] 2 . 
Consequently, f(Z) satisfies the conditions of Lemma [3l from which we con- 
clude that F (j{Z) > 4 ,2i8 yrek 771,7 ) > 0, by Lemma[15J Therefore, by Lemma 
[141 if 4^tfv/^"i' > « then (V',C) is a YES-instance for BETWEENNESS- 
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AA. Otherwise, we have m! = 0(k 2 ). This concludes the proof of the theorem. 

□ 

By deleting variables not appearing in any constraint, we obtain a kernel 
with 0(k 2 ) constraints and variables. 

7.3 3-Linear Ordering 

In this subsection, we will give a short overview of the proof in [27] that 3- 
Linear Ordering has a kernel with at most 0(k 2 ) variables and constraints. 

Unfortunately, approaches which we used for 2-LlNEAR ORDERING-AA 
and BETWEENNESS-AA do not work for this problem. In fact, if we wanted 
to remove subsets of constraints where only the average number of constraints 
can be satisfied such that after these removals we are guaranteed to have more 
than the average number of constraints satisfied, then, in general case, an infinite 
number of reduction rules would be needed. The proof of this is quite long and 
therefore omitted from this survey, see ETTl for more information. 

However, we can reduce an instance of 3-LlNEAR ORDERING-AA to in- 
stances of Betweenness-AA and 2-Linear Ordering- AA as follows. With 
an instance (V, C) of 3-LlNEAR ORDERING-AA, we associate an instance (V, B) 
of Betweenness-AA and two instances (V,A f ) and (V,A") of 2-Linear 
Ordering-AA such that if C p = (u, v, w) G C, then add B p = (v, {u, w}) to 
B, a' p = (u, v) to A', and a'p = (v, w) to A". 

Let a be an ordering of V and let dev(V, C, a) denote the number of con- 
straints satisfied by a minus the average number of satisfied constraints in (V, C), 
where (V, C) is an instance of 3-Linear Ordering- AA, Betweenness-AA 
or 2-Linear Ordering-AA. 

Lemma 16. [27] Let (V,C,k) be an instance o/3-LlNEAR ORDERING-AA 
and let a be an ordering of V. Then 

dev(y, C,a) = - [dev(F, A', a) + dev(y, A", a) + dev(V, B, a)] . 

Therefore, we want to find an ordering satisfying as many constraints as 
possible from both of our new type of instances (note that we need to use the 
same ordering for all the problems). 

Suppose we have a No-instance of 3-LlNEAR ORDERING- AA. As above, 
we replace it by three instances of BETWEENNESS-AA and 2-LlNEAR ORDERING- 
AA. Now we apply the reduction rules for BETWEENNESS-AA and 2-LlNEAR 
Ordering-AA introduced above as well as the proof techniques described in 
the previous sections in order to show that the total number of variables and 
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constraints left in any of our instances is bounded by 0(k 2 ). We then transform 
these reduced instances back into an instance of 3-LlNEAR ORDERING-AA 
as follows. If {v,{u,w}} is a Betweenness constraint then we add the 3- 
LlNEAR ORDERING-AA constraints (u, v, w) and (w, v, u) and if (u, v) is an 
2-Linear Ordering- AA constraint then we add the 3-Linear Ordering- 
AA constraints (u, v, w), (u, w, v) and (w, u, v) (for any w G V). As a result, 
we obtain a kernel of 3-Linear Ordering-AA with at most 0(k 2 ) variables 
and constraints. 

7.4 (2,3)-Linear Ordering-AA 

In the previous subsection, we overviewed a result that 3 -LINEAR ORDERING- 
AA has a kernel with at most 0(k 2 ) variables and constraints. This result has 
been partially improved by Kim and Williams 1391 who showed that 3-LlNEAR 
Ordering-AA has a kernel with at most O(k) variables. We will now outline 
their approach, where they considered (2,3)-LlNEAR ORDERING- AA. That is, 
we allow constraints to contain 2 or 3 variables. Thus, we can apply the fol- 
lowing reduction rules, where w(e) denotes the weight of constraint e (i.e., the 
number of times e appears in the constraint multiset) and if e = (u, v, w) is 
a constraint then we denote u by e(l), v by e(2) and w by e(3), and var(e) 
denotes the variables in e. 

Redundancy Rule: Remove a variable v from V if it does not appear in any 
constraint. Remove a constraint e from C if its weight is zero. 

Merging Rule: If e\ and e 2 are identical, then replace them by a single con- 
straint of weight w(e\) + w(e 2 ). 

Cancellation Rule: If there are two constraints e\, e2 with \e\\ = 1 62 1 = 2 
and e2 = (ei(2), e\ (1)), let w m i n = min{u;(ei), wfa)} and replace the 
weights by w(e{) = io(ei) - w min and w(e 2 ) = w(e 2 ) - w mhl . 

Edge Replacement Rule: If e%, e^-, &z are three constraints in C with var{e\ ) = 
varied) = var{e^) and such that e 2 = (ei(2),ei(l),ei(3)) and = 
( ei (l),ei(3), ei (2)),then: 

- replace the weight of a constraint by u>(ej) = w(ei) — w m i n for each 
i = 1,2, 3, where w m \ n = min{u;(ei), w{e 2 ), w(e%)}. 

- add the binary ordering constraint (ei(l), ei(3)) of weight w m i n . 

Cycle Replacement Rule: If e\ , e 2 , e% are three constraints in C with var (e\ ) = 

var(e 2 ) = var{e^) and such that e 2 = (ei(2),ei(3),ei(l)) and = 
(ei(3),ei(l),ei(2)),then: 

- replace the weight of a constraint by w(ei) = w(ei) — w m i n for each 
i = 1,2,3, where u; min = mm{w(e\), w(e 2 ), u>(e 3 )}. 
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- add the three binary ordering constraints (ei(l),ei(2)), (ei(2), ei(3)) 
and (ei(3), ei(l)), each of weight w mm . 



In 11391 it is shown that these reduction rules produce equivalent instances. 
In |[39l the following theorem is then proved. 

Theorem 24. H39V Let I = (V, C, k) be an irreducible (under the above reduc- 
tion rules) instance of (2,3)-LlNEAR ORDERING- AA. If I is a No-instance 
( that is, less than pW + k constraints in I can be simultaneously satisfied, where 
pW is the average weight of clauses satisfied by a random ordering ), then the 
number of variables in I is O(k). 

In order to prove this theorem some above-mentioned techniques were used. 
Let n = \V\. As for BETWEENNESS-AA (see Subsection 17721). Kim and Williams 
[39] introduced a random variable f(yi, . . . , y2n), which is a polynomial of de- 
gree 6 with 2n random uniformly distributed and independent variables yi, each 
taking value 1 or —1. The key property of f(yi, ■ ■ ■ , V2n) is that for every No- 
instance I we have f(yi, . . . ,y 2n ) < k for each (yi, . . . ,y 2n ) G {-1, l} 2n - 
In Subsection 17.21 a similar inequality was used to bound the number of con- 
straints in / using a probabilistic approach. Kim and Williams [39] use a differ- 
ent approach to bound the number of variables in /: they algebraically simplify 
f(yi, ■ ■ ■ , 2/2n) an d obtain its Fourier expansion (see Q). As in Subsection 15.31 
the Fourier expansion can be viewed as the excess of the corresponding instance 
of MAX-6-LIN2-AA. Thus, to bound the number of variables in the Fourier ex- 
pansion, we can use Theorem [8] (or, its weaker version obtained in 11391 ) which 
implies that the number is O(k). 

However, there was a major obstacle that Kim and Williams 091 had to 
overcome. In general case, as a result of the algebraic simplification, the num- 
ber of variables in the Fourier expansion may be significantly smaller than 2n 
and, thus, the bound on the number of variables in the Fourier expansion may 
not be used to bound n. To overcome the obstacle, Kim and Williams carefully 
analyzed the coefficients in the Fourier expansion and established that every 
variable of V is "represented" in the Fourier expansion. As a result, they con- 
cluded / can have only O(k) variables. 

Acknowledgments Research of Gutin was supported in part by the 1ST Pro- 
gramme of the European Community, under the PASCAL 2 Network of Excel- 
lence. Research of Gutin and Yeo was partially supported by an International 
Joint grant of Royal Society. 



30 



References 



1. N. Alon, Voting paradoxes and digraphs realizations, Advances in Applied Math. 29:126- 
135, 2002. 

2. N. Alon, G. Gutin, E. J. Kim, S. Szeider, and A. Yeo, Solving MAX-r-SAT above a tight 
lower bound. Algorithmica 61(3):638-655, 2011. 

3. N. Alon, G. Gutin and M. Krivelevich. Algorithms with large domination ratio, J. Algorithms 
50:118-131, 2004. 

4. J. Bang- Jensen and G. Gutin. Digraphs: Theory, Algorithms and Applications. Springer- 
Verlag, London, 2nd edition, 2009. 

5. H. L. Bodlaender, R.G. Downey, M.R. Fellows, and D. Hermelin, On problems without 
polynomial kernels. J. Comput. Syst. Sci. 75(8):423-434, 2009. 

6. H. L. Bodlaender, S. Thomasse, and A. Yeo, Kernel bounds for disjoint cycles and disjoint 
paths. Proc. ESA 2009, Lect. Notes Comput. Sci. 5757:635-646, 2009. 

7. A. Bonami, Etude des coefficients de Fourier des fonctions de L P (G). Ann. Inst. Fourier, 
20(2):335^02, 1970. 

8. M. Charikar, V. Guruswami, and R. Manokaran, Every permutation CSP of arity 3 is approx- 
imation resistant. Proc. Computational Complexity 2009, 62-73. 

9. C. Chen, I. Kanj and W. Jia, Vertex Cover: Further observations and further improvements. 
J. Algorithms, 41: 280-301, 2001. 

10. M. Chlebfk and J. Clebfkova, Crown reductions for the Minimum Weighted Vertex Cover 
problem. Discrete Appl. Math., 156: 292-312, 2008. 

11. B. Chor and M. Sudan. A geometric approach to betweenness. SIAM J. Discrete Math., 
11(4):51 1-523, 1998. 

12. R. Crowston, M. Fellows, G. Gutin, M. Jones, F. Rosamond, S. Thomasse and A. Yeo, Si- 
multaneously satisfying linear equations over F2 : MaxLin2 and Max-r-Lin2 parameterized 
above average. Tech. Report, arXiv: 1104. 1135 ^3, 2011. 

13. R. Crowston, G. Gutin, and M. Jones, Note on Max Lin-2 above average. Inform. Proc. Lett. 
110:451^154, 2010. 

14. R. Crowston, G. Gutin, M. Jones, E. J. Kim, and I. Ruzsa. Systems of linear equations over 
F2 and problems parameterized above average. Proc. SWAT 2010, Lect. Notes Comput. Sci. 
6139: 164-175, 2010. 

15. R. Crowston, G. Gutin, M. Jones, V. Raman, and S. Saurabh, Parameterized Complexity of 
MaxSat Above Average. Tech. Report, larXry: 1 108.450l1v l . 2011. 

16. R. Crowston, G. Gutin, M. Jones, and A. Yeo, A new lower bound on the maximum number 
of satisfied clauses in Max-SAT and its algorithmic applications. Algorithmica, to appear. 
Preminary version in Proc. oflPEC 2010, Lect. Notes Comput. Sci. 6478 (2010), 84-94. 

17. M. Cygan, M. Pilipczuk, M. Pilipczuk, and J.O. Wojtaszczyk, On Multiway Cut parameter- 
ized above lower bounds. Proc. 1PEC 2011, to appear. 

18. R. G. Downey and M. R. Fellows. Parameterized Complexity, Springer, 1999. 

19. H. Fernau, Parameterized Algorithmics: A Graph-theoretic Approach. Habilitation thesis, U. 
Tubingen, 2005. 

20. H. Femau, FV. Fomin, D. Lokshtanov, D. Raible, S. Saurabh, and Y. Villanger, Kernel(s) for 
problems with no kernel: On out-trees with many leaves. Proc. STACS 2009, 421^132. 

21. H. Fleischner, O. Kullmann and S. Szeider, Polynomial-time recognition of minimal unsat- 
isfiable formulas with fixed clause-variable difference. Theoret. Comput. Sci., 289(1):503- 
516, 2002. 

22. J. Flum and M. Grohe. Parameterized Complexity Theory, Springer, 2006. 

23. M. R. Garey and D. R. Johnson, Computers and Intractability, W.H. Freeman & Comp., 
New York, 1979. 



31 



24. G. Gottlob and S. Szeider, Fixed-parameter algorithms for artificial intelligence, constraint 
satisfaction, and database problems, Comput. J. 51(3): 303-325, 2008. 

25. V. Guruswami, J. Hastad, R. Manokaran, P. Raghavendra, and M. Charikar, Beating the 
random ordering is hard: Every ordering CSP is approximation resistant. Elec. Coll. Comput. 
Complex. 18:27, 2011. 

26. V. Guruswami, R. Manokaran, and P. Raghavendra, Beating the random ordering is hard: 
Inapproximability of maximum acyclic subgraph. Proc. FOCS 2008, 573-582. 

27. G. Gutin, L. van Iersel, M. Mnich, and A. Yeo, Every ternary permutation constraint satisfac- 
tion problem parameterized above average has a kernel with a quadratic number of variables, 
J. Comput. System Sci., in press, doi:10.1016/j.jcss.201 1.01.004. 

28. G. Gutin, M. Jones and A. Yeo, A New Bound for 3-Satisfiable MaxSat and its Algorithmic 
Application. Proc. FCT2011, Lect. Notes Comput. Sci. 6914: 138-147, 2011. 

29. G. Gutin, E. J. Kim, M. Mnich, and A. Yeo. Betweenness parameterized above tight lower 
bound. J. Comput. Syst. Sci., 76: 872-878, 2010. 

30. G. Gutin, E. J. Kim, S. Szeider, and A. Yeo. A probabilistic approach to problems parame- 
terized above tight lower bound. J. Comput. Syst. Sci. 77: 422-429, 201 1. 

31. G. Gutin and A. Yeo, Hypercontractive inequality for pseudo-Boolean functions of bounded 
Fourier width. Tech. Report, larXiv: 1 106. 1049^ 2, June 201 1. 

32. W. Guttmann and M. Maucher. Variations on an ordering theme with constraints. Proc. 
4th IFIP International Conference on Theoretical Computer Science-TCS 2006, pp. 77-90, 
Springer. 

33. J. Hastad, Some optimal inapproximability results. J. ACM 48: 798-859, 2001. 

34. J. Hastad and S. Venkatesh, On the advantage over a random assignment. Random Structures 
& Algorithms 25(2): 1 17-149, 2004. 

35. A. Isli and A.G. Cohn. A new approach to cyclic ordering of 2D orientations using ternary 
relation algebras. Artif. Intelligence, 122(1-2): 137-187, 2000. 

36. S. Jukna, Extremal Combinatorics With Applications in Computer Science, Springer- Verlag, 
2001. 

37. R.M. Karp, Reducibility among combinatorial problems, Proc. Complexity of Computer 
Computations, Plenum Press, 1972. 

38. S. Khot, On the power of unique 2-prover 1-round games. Proc. STOC 2002, 767-775. 

39. E.J. Kim and R. Williams, Improved parameterized algorithms for constraint satisfaction. 
Proc. IP EC 2011, to appear. 

40. M. Lampis, A kernel of order 2k — clog k for Vertex Cover. Manuscript, 201 1. 

41. K.J. Lieberherr and E. Specker, Complexity of partial satisfaction. J. ACM, 28(2):41 1-421, 
1981. 

42. K.J. Lieberherr and E. Specker, Complexity of partial satisfaction, II. Tech. Report 293, 
Dept. of EECS, Princeton Univ., 1982. 

43. D. Lokshtanov, New Methods in Parameterized Algorithms and Complexity, PhD thesis, 
Bergen, 2009. 

44. M. Mahajan and V. Raman. Parameterizing above guaranteed values: MaxSat and MaxCut. J. 
Algorithms, 31(2):335-354, 1999. Preliminary version in Electr. Colloq. Comput. Complex. 
(ECCC), TR-97-033, 1997. 

45. M. Mahajan, V. Raman, and S. Sikdar. Parameterizing above or below guaranteed values. 
J. Computer System Sciences, 75(2): 137-153, 2009. Preliminary version in Proc. IWPEC 
2006, Lect. Notes Comput. Sci. 4169: 38-49, 2006. 

46. S. Mishra, V. Raman, S. Saurabh, S. Sikdar and C.R. Subramanian, The complexity of Konig 
subgraph problems and above-guarantee Vertex Cover. Algorithmica, DOI: 10.1007/s00453- 
010-9412-2. 



32 



47. G.L. Nemhauser and L.E. Trotter. Vertex packings: structural properties and algorithms. 
Math. Programming, 8(l):232-248, 1975. 

48. R. Niedermeier. Invitation to Fixed-Parameter Algorithms. Oxford University Press, 2006. 

49. R. O'Donnell, Some topics in analysis of Boolean functions. Technical re- 
port, ECCC Report TR08-055, 2008. Paper for an invited talk at STOC'08, 
www.eccc.uni-trier.de/eccc-reports/2008/TR08-055/ . 

50. J. Opatrny, Total ordering problem. SIAM J. Comput., 8: 1 1 1-1 14, 1979. 

51. V. Raman, M.S. Ramanujan and S. Saurabh, Paths, Flowers and Vertex Cover. In: Proc. ESA 
201 1, to appear. 

52. I. Razgon and B. O' Sullivan. Almost 2-SAT is fixed-parameter tractable. J. Comput. Syst. 
Sci. 75(8):435~450, 2009. 

53. G. Reinelt, The linear ordering problem: Algorithms and applications, Heldermann Verlag, 
1985. 

54. A. Soleimanfallah and A. Yeo, A kernel of order 2k — c for Vertex Cover. Discrete Math., 
311(10-ll):892-895, 2011. 

55. S. Szeider, Minimal unsatisfiable formulas with bounded clause-variable difference are 
fixed-parameter tractable. J. Comput. Syst. Sci., 69(4):656-674, 2004. 

56. M. Yannakakis, On the approximation of maximum satisfiability. J. Algorithms, 17:475-502, 
1994. 



33 



